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Abstract. The state of the art in local search for the Traveling Salesman 
Problem is dominated by ejection chain methods utilising the Stem-and- 
Cycle reference structure. Though effective such algorithms employ very 
little information in their successor selection strategy, typically seeking 
only to minimise the cost of a move. We propose an alternative approach 
inspired from the AI literature and show how an admissible heuristic can 
be used to guide successor selection. We undertake an empirical analysis 
and demonstrate that this technique often produces better results than 
less informed strategies albeit at the cost of miming in higher polynomial 
time. 

1 Introduction 

Simple to state yet difficult to solve, the Traveling Salesman Problem (TSP) is 
one of the oldest and most widely studied in the field of combinatorial optimi- 
sation. The canonical (or symmetric) TSP requires finding a minimum cost tour 
of all nodes in a graph G — {V, E) where V — {wi, . . . , w„} is the set of all nodes 
and E — {{vi,Vj)\vi,Vj €V,i ^ j} is a set of edges connecting them such that 
cost (or weight) of each edge Cij = Cji and is non- negative for all {vi,Vj) G E. 

Despite receiving much attention the TSP has resisted attempts at developing 
a polynomial time algorithm to solve it exactly; a situation which is unlikely to 
change unless P = NP [7]. In response many researchers began focusing their 
attention on local search methods in order to produce approximate solutions in 
low polynomial time. By far the most successful of these efforts is the seminal Lin- 
Kernighan algorithm , variations on which have dominated the area for almost 
three decades. Recently however it has been shown that alternative techniques, 
using the Subpath Ejection Chain (or SEC) procedure, are able to compete with 
and in many cases outperform Lin-Kernighan |11I4I2| . 

SEC proceeds by identifying the Stem- and- Cycle reference structure; a span- 
ning subgraph of G which is similar to but not quite a tour (yet from which a 
tour can always be immediately generated). Using special transition rules one 
configuration of the reference structure is transformed into another such that 
a compound neighbourhood of reachable tours is created and from this neigh- 
bourhood a cost improving tour is selected. The power of the technique can 



2 Daniel Harabor and Philip Kilby 

be explained, in part, by a connectivity result due to Glover [3] which shows 
that the neighbourhood generated using Stem-and-Cycle is a super-set of that 
generated by Lin-Kernighan. 

At the heart of the SEC procedure is a simple policy which extends the 
ejection chain by always choosing to expand the neighbouring configuration 
with minimum transition cost |3j. While effective it has been suggested that 
the nearest-neighbour strategy fails to exploit the geometric structure of the 
TSP and tends to perform poorly on problem instances where nodes appear in 
clusters [1]. In the AI literature such difficulties are often overcome by extract- 
ing knowledge from the problem domain. Known as informed heuristics these 
methods estimate the cost of all remaining steps required to solve the problem. 
This is in stark contrast to the nearest-neighbour strategy which only considers 
one step at a time. 

We develop an alternative move operator which uses an informed heuristic to 
rank neighbouring configurations according to their "distance" from an apriori 
unknown target tour. We compute a 1-tree lower-bound [S] on the cost of the 
target tour and work on the thesis that this heuristic will guide our algorithm 
to neighbourhoods which contain lower cost tours and in relatively fewer steps 
than would otherwise be reachable using SEC. 

Our motivation is to show that on many smaller TSP instances ejection 
chains guided by informed heuristics are able to achieve better results than al- 
ternatives based on the nearest-neighbour strategy. Smaller TSPs are of interest 
because they often appear in many other areas of combinatorial optimisation. 
For example, in the Vehicle Routing Problem the number of requests which may 
be assigned to any one route is necessarily small (usually less than 100) due to 
physical limits on the capacity and range of each truck. 

2 Subpath Ejection Chains using Stem-and-Cycle 

The Stem-And-Cycle reference structure, as shown in Figure [ija), is a spanning 
subgraph of G that consists of a set of nodes ST — {ut,...,u,.} forming a 
simple path called the stem which is connected to another set of nodes CY = 
{vr, Vs-^ , ■ ■ ■ , J Vr} termed the cycle. The node connecting the stem to the cycle, 
Vr , is called the root while the node at the other end of the stem, vt , is known as 
the tip. Vr must always be connected to at least two unique nodes on the cycle 

and which are respectively designated the left and right subroots. 

The main idea behind Stem-and-Cycle (whose theoretical underpinnings may 
be found in [3]) is to search for a cost improving tour by re-arranging subpaths 
within the reference structure. This is achieved by defining a series of transition 
rules, or moves, that transform one Stem-and-Cycle configuration into another. 
Each move adds an edge {vt,Vi) that connects the tip to a node on the stem 
or cycle and deletes an adjacent edge {vi,Vj) such that the resulting spanning 
graph remains connected. When vt ~ v,. the structure is said to be degenerate 
and forms a valid tour. Further, although the Stem-and-Cycle structure is not 
itself a feasible solution to the TSP it is always possible to generate two such 
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tours or trial solutions^ by adding an edge {vt,Vs-^) or (vt^Vs^) and deleting the 
edge {vs-i^,Vr) or (vs^jVr) as appropriate. 



Fig. 1. (a) The Stem-and-Cycle reference structure. Node c is the tip, node a 
the root and nodes d and g the left and right subroots respectively, (b). SEC 
begins at tour S and creates a 4-level ejection chain (each level is denoted by a 
gray node) before choosing G as the best reachable trial solution (TS). 

The Stcm-and-Cycle reference structure is used as the basis for the Subpath 
Ejection Chain (or SEC) move operator. We briefly outline it here due to its 
central importance in developing other ideas in this paper; a full description, 
including pseudocode, is given in 

In broad terms SEC explores a neighbourhood by producing a series of inter- 
related moves that generate one Stem-and-Cycle configuration from another in 
a chain-like fashion. Each move constitutes a new level of the ejection chain and 
at each level it is necessary to evaluate the set of all legitimate Stem-and-Cycle 
configurations in order to select one which will further extend the chain. When 
the ejection chain has reached a maximal size the best trial solution reachable 
from any of its levels is selected as the new start tour. The process is repeated 
until SEC reaches a local minimum where no cost improving tour can be found. 
Figure fljb) serves to illustrate this idea. 




In order to guide SEC toward promising regions of the search space two 
important constraints are applied: 

1. Each added edge is marked interchangeably as either black or white. White 
edges are assumed to belong to a target tour into which we seek to transform 
the starting tour. Thus, white edges, once added, cannot be subsequently 
deleted. Black edges are assumed to not belong to the target tour and may 
be deleted as part of a future move. 

2. Each deleted edge is marked "tabu" and cannot appear in a subsequent level 
of the ejection chain. 

The first constraint seeks to focus the search toward an apriori unknown 
target tour which may or may not be cost improving. It also serves to limit 
the length of the ejection chain to a maximum of 2n levels, where n = \V\. 




(a) 



(b) 
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The second constraint prevents the search from cyclical backtracking. Any move 
which violates one or both constraints is deemed to be illegitimate and not 
considered. 



3 Growing Ejection Chains Using Heuristic Search 

The SEC algorithm searches for cost improving tours by creating a series of 
multi-level ejection chains. At each level of a chain the algorithm generates a 
set r of up to 2n legitimate Stem-and-Cyclc configurations [3]. Each x £ F is 
evaluated using NN, a nearest-neighbour heuristic function modeled as: 

NN{x) = ej ectionV alue -\- min[trialV alue) 

The first term, ejectionValue, refers to the cost of the move generating x 
from the current Stem-and-Cycle configuration. The second term, min(trialV alue) , 
is the minimum cost associated with generating one of two the trial solutions 
reachable from x. Since each NN(x)\x S F requires constant time to evaluate 
a maximally sized ejection chain with up to 2n levels may be constructed in no 
more than 0(4n^) time. 

Although SEC runs very fast its nearest-neighbour heuristic is not very accu- 
rate. In particular, there is a risk that SEC will choose a successor which appears 
promising but ultimately leads to a section of the search space that contains no 
cost improving tour. To address this issue we propose an alternative move oper- 
ator that is based, in part, on a well known algorithm from the heuristic search 
literature: A* |5j. Developed in the context of the single-source shortest-path 
problem, A* makes use of an informed heuristic function that evaluates nodes 
in the search space by computing a lower-bound on the remaining distance to 
reach a known goal node. In the canonical implementation the final or f-cost 
associated with an arbitrary node x is given by the function: 

fix) = gix) + h{x) 

where h is the heuristic cost and g is the weight of the path from the start node to 
X. Provided h is admissible (i.e. it never overestimates the distance to the goal) 
this approach is guaranteed to find a shortest path to the goal if one exists. 
Applying such an algorithm to solve the TSP is not straightforward however. 
In particular, it is difficult to measure the "distance" between an arbitrary tour 
and the optimal tour as the latter is not known apriori. Another difficulty arises 
from the fact that A* is a systematic tree-search algorithm; when applied to 
the TSP it exhibits exponential space complexity requirements which make it 
infeasible for solving all but the most trivial instances |10|9) . In the remainder 
of this section we will address each of these problems in turn and in the process 
develop a new Stem-and-Cycle move operator that constructs ejection chains 
with the help of a an informed heuristic function. 
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3.1 Problem Formulation 

We begin by recalling that every ejection chain created by the Stem-and-Cycle 
move operator must respect a set of edge exclusion constraints (namely those 
edges which have been previously deleted and cannot be re-added) and a set of 
edge inclusion constraints (namely those edges marked as white which cannot 
be deleted). Combined, these constraints guide the ejection chain toward a par- 
ticular but apriori unknown tour. We designate this tour g and the associated 
start tour, from which the algorithm begins, s. Let these tours correspond to 
A*'s goal node and start node respectively. 

We define our search space as an undirected meta-graph Gm = {Vm,Ejn) 
where Vm represents the set of nodes in the graph and Em the set of edges. Let 
each node x € Vm represent a valid Stem-and-Cycle configuration. We will say 
that an edge {x, y) € Em exists between two nodes, x and y, if the Stem-and- 
Cycle move operator can induce y from x (or vice versa) in a single move. The 
weight of each edge is thus equal to the cost of the associated Stem-and-Cycle 
move that induces one of its endpoints from the other. Under this formulation 
s,g G Vm and our goal is to find a path HI^jS) connecting them such that the 
weight of g is minimised: 

J|(s,5) : min{g) 

The existence of the path is guaranteed by a result due to Glover [3] which 
shows that Stem-and-Cycle can generate a tour from any other using not more 
than 2n moves. 

3.2 Modeling The Cost Function 

Recall that A* ranks nodes by computing a final value which is the sum of two 
cost functions: g and h. We proceed by developing TSP-specific analogues for 
each of these. 

First we consider the problem of calculating a g-cost which requires exactly 
measuring the distance from s to some candidate successor node c S Vm- To 
achieve this we can simply take the sum of the weights of all edges designated 
white appearing in c. We reason that a node which contains more white edges 
is in some sense "closer" to g than one containing fewer white edges. 

Meanwhile, calculating a h-cost is analogous to computing a lower-bound on 
the cost of all white edges which have yet to be added in order to transform 
c into g. We accomplish this by taking the weight of all non-white edges in a 
constrained 1-tree. 

Conceptually similar to the canonical 1-tree |6j (the construction of which we 
outline in Figure[2]) the constrained 1-tree differs in that it contains all edges from 
a given include set and none of the edges from a given exclude set. Algorithm [T] 
describes the steps necessary to create this data structure. Note that this idea 
was originally suggested by Pohl jTUj but not described in any detail. As such 
we believe this work is the first elaborate study of the constrained 1-tree. 
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Fig. 2. (a) Begin the 1-tree by first constructing an MST of all tour nodes 
except one (here, node i). (b) Complete the 1-tree by connecting node i to the 
rest of the spanning tree via the two smallest weight edges incident with it (here, 
edges (z, a) and (i,e)). 

Our implementation takes a set of nodes V — {vi, . . . , w„} and proceeds by 
creating a MST on the subset of nodes {v2, ■ ■ ■ , However we augment the 
basic MST construction algorithm of Prim (as described in [1]) such that it 
respects all edge constraints to which the move operator is subject. We make 
two changes: 

1. In lines 19-21 of Algorithmjl] white edges are added to the heap are given the 
highest priority. This ensures that all such edges are immediately selected 
for inclusion into the MST. 

2. In lines 23-25 of Algorithm [T] before any non- white edge is added to the heap 
we make sure it is not on the excluded edges list. If it is, we immediately 
discard it from further consideration. 

We also apply the following rules at line 16 of Algorithm [l] (where we connect 
the excluded node vi to the MST and complete the 1-tree): 

1. Any white edge incident with node vi which appears in the included edges 
list is automatically selected, regardless of cost. 

2. Any non-white edge incident with node vi which appears in the excluded 
edges list is automatically discarded, regardless of cost. 

3. The remaining edges incident with node Vi are ranked by their weight. From 
this set we always select the one with smallest weight. 

3.3 On The Admissibility Of The Constrained 1-Tree Heuristic 

The canonical 1-tree always lower-bounds the optimal tour and thus the the 
goal configuration g. However, it is not immediate that the constrained 1-tree, 
as constructed by Algorithm 1, is also a lower-bound on g. In particular we 
would like to show that the heuristic function h, which is based on the sum cost 
of non-white edges in a constrained 1-tree, is a lower-bound on the sum cost of 
any as-yet-unknown set of white edges which appear in the goal tour g. This 
property is required to ensure the admissibility of the heuristic function h. 
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Algorithm 1 Constrained 1-Tree 

Require: G — {V,E), ExcludedEdges, IncludedEdges 

1: vi -s-getNodeAtIndex(y,0) 

2: mstroot -h- getNodeAtIndex(V, 1) 

3: Neighbours V — {v\ U mstroot} 

4: Heap initHeap(A''ei(7ft6o«rs) 

5: processNeighbours(msfroot) 

6: wliile Neighbours 7^ do 

7: n -^^ min^Heap) 

8: if parent(n) = or n = tlien 

9: return 

10: if makeEdge (parent (p), n) G ExcludedEdges then 

11: return 

12: parent(n).addChiId(n) 

13: Neighbours {Neighbours — n} 

14: processNeighbours(n) 

15: return makelTree{mstroot, vi) 

procedure processNeighbours(n) 

16: for all x G Neighbours do 

17: e makeEdge(n, x) 

18: if e G IncludedEdges then 

19: Heap, priority (a;, 0) 

20: parent(a;) ^ n 

21: else 

22: if e ^ ExcludedEdges and priority(a;) < cost(e) then 

23: Heap.priority(a::, cost(e)) 

24: parent (a;) n 



Lemma 1. For all c G Vm it is always possible to construct a constrained 1-tree. 

Proof. To prove this claim it is sufficient to observe that every Stem-and-Cycle 
configuration, including the optimal tour, is itself a 1-tree. □ 

Lemma 2. The constrained minimum spanning tree associated with an arbitrary 
node c G Vm is a lower-bound on the goal configuration g. 

Proof. We proceed with a structural proof by induction on the spanning tree 
constructed for c. First we will analyse two distinct possibilities for the base 
case: c — s or c — g. We then proceed with an analysis of the general case in 
which c is subject to an arbitrary number of edge inclusion and edge exclusion 
constraints. In each case we will show that Algorithm [T] produces a spanning 
tree which is a lower-bound on g. 

Base Case 1: c = s. In this case there are no edge inclusion or exclusion con- 
straints. Algorithm [T] mirrors the behaviour of Prim's algorithm and thus 
creates an unconstrained spanning tree of minimum weight. Clearly such a 
structure is a lower-bound on g. 
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Base Case 2: c ^ g. In this case every edge in c is white and appears in the 
set of edges that must be included. Algorithm [T] constructs the constrained 
MST by looking for minimum cost edges that connect a leaf of the spanning 
tree to some other node that is not yet in the tree. If any edge from this 
candidate set is white it is assigned the highest priority which guarantees 
it will be selected the next time the spanning tree is extended. Since every 
node in c is incident with at least one white edge it follows that each node is 
added to the spanning tree by way of a white edge. As the constrained MST 
contains only white edges its weight must be a lower-bound on g. 

Inductive Case: c contains some number of white and non-white edges. We 
know from Base Case 2 that every white edge appearing in g also appears 
in the constrained MST associated with c. It remains to show that the set 
of non-white edges appearing in the constrained MST is minimum. 
When considering a non- white edge Algorithm [l] will rank candidates that 
have not been explicitly excluded by their weight and always select for in- 
clusion into the spanning tree the edge with smallest weight. Consequently, 
no edges which have been deleted can appear in the spanning tree and of 
the remaining candidate edges only those edges of least cost which do not 
induce a cycle are selected when extending the tree. The set of non-white 
edges included in the spanning tree is thus minimum and must be a lower- 
bound on the weight of any white edges which later appear in g. Thus the 
weight of the constrained MST is a lower-bound on g. 

□ 

Theorem 1. The weight of a constrained 1-tree associated with some arbitrary 
node c G Vm is a lower-bound on the weight of the goal configuration g. 

Proof. We recall that a constrained 1-tree on the set of nodes {vi, . . . ,Vn} G c 
is a two-step process requiring the union of a constrained MST on the nodes 
{v2, ■ .. ,Vn} with the two least-cost (possibly white) valid edges incident with 
vi. We know from Lemma [2] that the weight of the constrained MST is a lower- 
bound on g. It remains to show that the two edges connecting vi to the spanning 
tree are a lower-bound on the cost of two edges incident with vi in the goal 
configuration g. 

This operation follows the edge selection policy outhned in Section [3^ i.e. we 
preference white edges above all others, regardless of cost, disconsider excluded 
edges and rank the remaining edges by weight. Thus the two edges selected to 
connect vi to the constrained MST must be a lower-bound on the cost of the two 
edges incident with vi in g. From this we conclude that the constrained 1-tree 
constructed by Algorithm [T] is itself a lower-bound on g. □ 

Theorem [T] is sufficient to show that a /i is admissible. Furthermore, each 
generated node evaluated by h will be subject to a larger set of edge constraints 
than its parent. From this it is readily shown that h is also monotonic and thus 
guaranteed to be non-increasing as the search approaches the target tour g. 
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3.4 The Informed Subpath Ejection Chain Algorithm 

We have combined the Stem-and-Cycle reference structure with the constrained 
1-tree lower-bound in order to develop a TSP-specific analogue of A*. If we apply 
the canonical form of the algorithm to Gm, always expanding the node with the 
least lower-bound, we are guaranteed to find the optimal tour. However, this 
operation is likely to be computationally intractable unless our heuristic function 
h is able to compute a near-perfect estimate of the remaining distance to reach 
the optimal tour. If this is not the case A* will expand an exponential number 
of nodes [5]. 

Since we are interested in an approach that is guaranteed to run in low poly- 
nomial time we modify the canonical A* algorithm by reducing the branching 
factor of each node to 1. This makes our new algorithm analogous to SEC |3|11) 
however instead of always choosing the neighbour with minimum transition cost 
as the next level of the ejection chain we instead choose the neighbour which 
minimises the cost of our evaluation function /. We term the resulting technique 
Informed SEC (or ISEC for short) and outline it in Algorithm [2j 



Algorithm 2 Informed Subpath Ejection Chain Algorithm (ISEC) 
Require: s 

1: next <— s 

2: hestTour minTrialSolution(s) 
3: while next 7^ do 

4: if minTrialSolution(nea;f) < bestTour then 

5: bestTour minTrialSolution(nea;f); 

6: Fnext expand (nest); 

7: next 

8: for all x £ r^ext do 

9: if next — then 

10: next <— x 

11: else 

12: if f(a;) < i{next) then 

13: next x 

14: return bestTour 



4 Complexity 

We analyse the running time of the ISEC algorithm in terms of node expansions. 
When a node x S Gm is expanded we generate the set which contains all 
legitimate Stem-and-Cycle configurations reachable in a single move. We then 
select min^r^) as the next node to be expanded and the process continues until 
a maximal size ejection chain has been built. Since every second move adds to 
X a white edge which cannot be deleted the maximum number of nodes that 
ISEC can expand per ejection chain is 2n. Additionally, the size of is itself 
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not more than 2n which gives us an 0(4n^) bound on the total number of nodes 
that must be generated per ejection chain. 

For each node y G F^we call Algorithm[l]to create a constrained 1-tree. Using 
a derivative of Prim's MST algorithm we first compute a constrained MST on 
the nodes {v2, . . . , w„} S V. This is an 0{{n — 1)^ x log2{n — 1)) operation on a 
complete graph such as the Symmetric TSP. To finish the 1-tree we connect node 
vi G V to the constrained MST via its two least-cost edges. However, we have 
to exclude from consideration any edges which have been previously deleted and 
prioritise any edges which have been marked white. Finding two edges which 
satisfy these constraints is an 0{n— 1) operation, even if we pre-sort the the set 
of edges incident with vi . Thus the total time required to compute a constrained 
1-tree is 0((n — 1)^ x log2(n — 1) + {n — 1)) 

If we assume that each ejection chain constructed by ISEC will require a 
maximal number of node expansions then the total running time of the algorithm 
is 0(4n^ X [{n — 1)^ x log2(?i — 1) + {n — 1)]). We may roughly approximate this 
as 0{4n^ x log2{n) -I- 4n"^). 

Clearly the performance of ISEC is limited by the time required to compute a 
constrained 1-tree. However, it is not strictly necessary to compute a new 1-tree 
for each generated node. Consider the following argument: 

Let p, c S Vm be two Stem-and-Cycle configurations such that c is generated 
as a result of expanding p. Such a move adds edge Cq and deletes the edge e^. 
Next, let Tp and Tc be the associated constrained 1-trees which are constructed 
to evaluate p and c respectively. We proceed with a discussion on the effect that 
the Stem-and-Cycle move generating c from p has on the structure of Tc with 
respect to Tp. In particular, we identify four distinct types of changes which may 
occur: 

Change 1: Ca ^ Tp and ^ Tp. In this case two possibilities exist, depending on 
whether Cq is white or not. 

i) If we suppose is not white then we are not forced to include in 
Tc any edges which were not already in Tp. Further, since Cd ^ Tp 
we do not have prevent any edges appearing in Tc which were 
present in Tp. Thus, Tp — Tc and we do not have to generate a 
separate 1-tree for c. 

ii) If we suppose Ca is white then we are forced to include it in 
Tc. However as ^ Tp we cannot re-use Tp in place of Tc and 
still retain the admissibility property of the heuristic function h. 
Thus we must generate Tc from scratch. 

Change 2: Ca ^ Tp and ed € Tp. In this case the deletion of Cd divides Tp into 
two separate subtrees which must be re-connected with a least-cost 
edge that has not been previously deleted. Finding such an edge is an 
operation requiring O(n^) time and thus asymptotically equivalent 
with computing tc from scratch. The status of Ca (black or white) is 
of no consequence. 

Change 3: Ca & Tp and ^ Tp In this case two possibilities exist, depending on 
whether Ca is white or not. 
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i) If Ca is a white edge its inclusion into Tc is forced. Consequently, 
Algorithm [l] will produce a 1-tree such that cost(Tc) > costirp). 
However, as G Tp it must be that Tp respects the same set 
of constraints to which Tc is subject. Thus, Tp is also a valid 
lower-bound for c and we may use it in favour of generating Tc- 

ii) If Ca is a black edge the same argument as before holds; Tp is a 
direct lower-bound for c and we may use it in favour of generating 

Tc- 

Change 4: € Tp and g Tp. The same argument which we used for Change|2] 
is also applicable here. The deletion of divides Tp into two separate 
subtrees which must be re-connected with a least cost-edge. This 
operation is asymptotically equivalent to generating Tc from scratch. 
The status of Cq (black or white) is of no consequence. 

These results are interesting because they show that in half of all node gen- 
eration scenarios it is not necessary to compute a 1-trec. In spite of this the 
theoretical running time of ISEC remains unchanged and in the worst case we 
still need to compute a constrained 1-tree for every node generated. We have 
observed that in practice this scenario is unlikely; as our experimental results 
will show a speedup of 50% is not uncommon. There are however some theoret- 
ical side-effects which merit discussion. In particular, notice that in Change [3i] 
although Tc can be shown to obey the same set of constraints as Tp its cost may 
infact be higher. In such cases the accuracy of the heuristic function h is dimin- 
ished as not all constraints were taken into consideration when computing h[c). 
We explore this issue empirically and show that in practice it is not a significant 
drawback. 

5 Experimental Setup 

We implemented and tested SEC (as described in [II]), ISEC and fast-ISEC 
(a variant of ISEC using the speedup technique described in Section |4]) on 30 
small-to-medium problems from TSPLIB [T^. These ranged in size from 29 to 
200 cities. We generate 10 experiments per problem by randomly perturbing the 
initial order of cities. We evaluate the performance of both algorithms on each 
of these 10 instances making for a total of 600 distinct experiments. All tests 
were undertaken on an Intel Core2Duo machine with 2GB of RAM running OSX 
10.5.7. 

Our main objective is to compare the relative quality of successor nodes 
chosen during a neighbourhood search. To this end we ran a single iteration of 
both SEC and ISEC and measured the length of the best corresponding trial 
solution. Note that in the case of ISEC an iteration refers to the construction of 
a single ejection chain of maximal length. In the case of SEC an iteration may 
consist of many such such ejection chains; the algorithm only terminates when 
a local minima is reached. 
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Results 



Table [T] summarises the quality of computed solutions for the SEC, ISEC and 
fast-ISEC algorithms. We evaluate algorithmic performance by taking the length 
of each solution found and measuring the relative percentage deviation from the 
optimal solution. In each case we give figures for the best and average solutions 
found (columns Min and Mean respectively). We also provide average running 
times for each algorithm (in columns CPU) which are taken as wall-clock time 
and measured in seconds. Finally we evaluate the average quality of the starting 
tour (column STQ) as measured in multiples of the optimal solution length. 



Table 1. Quality of solutions (% above optimal) and CPU time (in seconds) for 
SEC, ISEC and fast-ISEC. 
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Looking at the Table [T] we see that with respect to the quahty of the average 
solution found ISEC outperforms SEC in 26 of the 30 problem instances. In 
particular, the tours found by ISEC are on average 18.44% closer to optimal. A 
similar observation holds if we look at the quality of minimum solutions found. 
As before, ISEC's performance dominates SEC - this time in 25 of the 30 problem 
instances - by an average of 14.4%. Similar results are seen in the case of fISEC 
however the average and minimum solution quality improvements over SEC are 
slightly lower at 15.89% and 13.56% respectively. As this is quite a coarse analysis 
we turn our attention to Figure |3][a) which shows the spread of results for mean 
solution quality. 



(a) Mean Solution Quality (b) Solution Quality (SQ) vs. Start Tour Quality (STQ) 




ISEC fISEC SEC 200 400 600 800 1000 1200 1400 

STQ (multiples of optimal tour length) 



Fig. 3. (a) Variability of mean solution quality (measured as % above optimal), 
(b) The effect of start tour quality on solution quality. 

Here we notice that the inter-quartile range of tours found by ISEC is just 
11% and includes no solutions more than 46% from optimal. Meanwhile SEC 
exhibits quite different behaviour; the inter-quartile range encompasses a very 
wide 39% range and includes tours which are up to 80% above optimal. To 
explore this effect we turn our attention to Figure |3|b) in which we measure the 
effect of start tour quality on solution quality. 

Using a simple least squares regression analysis we are able to see that the 
performance of both algorithms appears to be strongly correlated to the quality 
of the starting tour. In particular, notice that as the "distance" between the 
optimal tour and the start tour increases there is a corresponding drop in the 
mean quality of the best solution found. This suggests that such problems are 
"harder" than others, requiring the ejection of a greater number of edges before a 
good quality solution is found. As before, ISEC performs much better than SEC 
almost across the board. This is directly attributable to our informed heuristic 
function which, it appears, is able to find promising regions of the search space 
much more consistently than SEC's nearest- neighbour method. 
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Next, we analyse the search effort of SEC and ISEC which we measure in 
terms of nodes expanded and nodes generated. Looking at Figure |4] we notice 
that in the case of both metrics the performance of the two algorithms scales 
linearly with problem size. In the case of ISEC this is because the algorithm 
only constructs a single ejection chain; thus no more than 2n nodes can ever be 
expanded. By comparison, SEC tends to expand (and consequently generate) 
an order of magnitude more nodes. From this we may infer that on average 
approximately 10 ejections chains must be constructed before the algorithm 
reaches a local minima and terminates. 




Fig. 4. Total search effort (SEC vs. ISEC). 



An interesting dichotomy appears when we contrast Figure [4] with the corre- 
sponding CPU times from Table [T] We see that although ISEC generates much 
fewer nodes than SEC it is up to 2 orders of magnitude slower. We may attribute 
this behaviour to ISEC's evaluation function which must compute a 1-tree for 
every node generated. This effectively limits ISEC to solving much smaller in- 
stances than SEC. Our goal however is not to be able to compute solutions faster 
than SEC but rather to illustrate the power of informed heuristics for guiding 
ejection chain algorithms. To that end we believe ISEC's ability to find alter- 
native neighbourhoods which contain demonstrably higher quality tours makes 
it a promising alternative. Even so, we can report that the running time of all 
three algorithms can be improved by artificially limiting the maximum length of 
each ejection chain from 2n to n. This change has a minimal effect on solution 
quality (the same trends were observed) but halves the CPU times in Table [T] 



7 Conclusion 

We have shown that it is possible to generate Stem-and-Cycle ejection chains 
using an informed successor selection strategy. We develop the constrained 1- 
tree heuristic which computes a lower-bound on an apriori unknown goal tour 
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with respect to a set of edge inclusion and edge exclusion constraints. We show 
that this approach is both admissible and monotonic, never overestimating the 
cost to reach the goal tour. We combine the constrained 1-tree heuristic with 
the Subpath Ejection Chain (SEC) procedure to develop a new informed move 
operator which we term ISEC. We show that ISEC, using only a single ejection 
chain, consistently computes higher quality solutions to the TSP than SEC which 
generates many more ejection chains. Wc give complexity results for ISEC and 
show that although its worst-case theoretical running time is 0(4ri^ x Iog2(ri) + 
4n^) we are able to use it as a feasible alternative to SEC for TSP instances of 
up to 100 nodes. 

Future work involves applying ISEC to similar size TSPs featuring precedence 
and time window constraints. Often appearing as subproblcms in other areas of 
combinatorial optimisation (for example the Vehicle Routing Problem with Time 
Windows), these domains are interesting because the set of feasible solutions that 
may be found is much smaller than in the canonical TSP. 
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